Compositionality in (high-dimensional) space
نویسنده
چکیده
Formal semanticists have developed sophisticated compositional theories of sentential meaning, paying a lot of attention to those grammatical words (determiners, logical connectives, etc.) that constitute the functional scaffolding of sentences. Corpus-based computational linguists, on the other hand, have developed powerful distributional methods to induce representations of the meaning of contentrich words (nouns, verbs, etc.), typically discarding the functional scaffolding as "stop words". Since we do not communicate by logical formulas, nor, Tarzan-style, by flat concatenation of content words, a satisfactory model of the semantics of natural language should strike a balance between the two approaches. In this talk, I will present some recent proposals that try to get the best of both worlds by adapting the classic view of compositionality as function application developed by formal semanticists to distributional models of meaning. I will present preliminary evidence of the effectiveness of these methods in scaling up to the phrasal and sentential domains, and discuss to what extent the representations of phrases and sentences we get out of compositional distributional semantics are related to what formal semanticists are trying to capture.
منابع مشابه
Non-Linear Similarity Learning for Compositionality
Many NLP applications rely on the existence of similarity measures over text data. Although word vector space models provide good similarity measures between words, phrasal and sentential similarities derived from composition of individual words remain as a difficult problem. In this paper, we propose a new method of of non-linear similarity learning for semantic compositionality. In this metho...
متن کاملیک روش مبتنی بر خوشهبندی سلسلهمراتبی تقسیمکننده جهت شاخصگذاری اطلاعات تصویری
It is conventional to use multi-dimensional indexing structures to accelerate search operations in content-based image retrieval systems. Many efforts have been done in order to develop multi-dimensional indexing structures so far. In most practical applications of image retrieval, high-dimensional feature vectors are required, but current multi-dimensional indexing structures lose their effici...
متن کاملInter-observer agreement between 2-dimensional CT versus 3-dimensional I-Space model in the Diagnosis of Occult Scaphoid Fractures
Background: The I-Space is a radiological imaging system in which Computed Tomography (CT)-scans can be evaluated as a three dimensional hologram. The aim of this study is to analyze the value of virtual reality (I-Space) in diagnosing acute occult scaphoid fractures. Methods: A convenient cohort of 24 patients with a CT-scan from prior studies, without a scaphoid fracture on radiograph, ye...
متن کاملConstructing Two-Dimensional Multi-Wavelet for Solving Two-Dimensional Fredholm Integral Equations
In this paper, a two-dimensional multi-wavelet is constructed in terms of Chebyshev polynomials. The constructed multi-wavelet is an orthonormal basis for space. By discretizing two-dimensional Fredholm integral equation reduce to a algebraic system. The obtained system is solved by the Galerkin method in the subspace of by using two-dimensional multi-wavelet bases. Because the bases of subs...
متن کاملSupervised Feature Extraction of Face Images for Improvement of Recognition Accuracy
Dimensionality reduction methods transform or select a low dimensional feature space to efficiently represent the original high dimensional feature space of data. Feature reduction techniques are an important step in many pattern recognition problems in different fields especially in analyzing of high dimensional data. Hyperspectral images are acquired by remote sensors and human face images ar...
متن کاملDetecting Compositionality of Multi-Word Expressions using Nearest Neighbours in Vector Space Models
We present a novel unsupervised approach to detecting the compositionality of multi-word expressions. We compute the compositionality of a phrase through substituting the constituent words with their “neighbours” in a semantic vector space and averaging over the distance between the original phrase and the substituted neighbour phrases. Several methods of obtaining neighbours are presented. The...
متن کامل